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1. Introduction 

The main topic addressed in this paper can be outlined as follows: Let (W, B(W)) 
be a measurable space on which we are given two probability measures p and v. 
Assume that v is absolutely continuous with respect to p. Is there a measurable 
transformation T : W — > W such that the image of p under T is vl Of course 
at this generality the answer will be too vague to be of some interest, hence let 
us provide the space (W,B(W)) with more analytical structure in such a way 
that we can construct at least a Gaussian measure on it. 

We shall call the triple (W, H, p) an abstract Wiener space if W is a separable 
Frechet space, H is a separable Hilbert space, called the Cameron-Martin space, 
which is densely and continuously embedded in W . We also identify H with its 
continuous dual, hence the imbeddings W* H* = H <—> W are all dense 
and continuous. (W, B(W)) is equipped with a probability measure p such that 
for every e in W* , „(e,tw) has the law N(0, |e|#) under p. Although in the 
literature W is in general supposed to be a Banach space, it is more interesting 
to assume W to be a (separable) Frechet space, since such a space is a countable 
projective limit of Banach spaces, all the properties of abstract Banach- Wiener 
spaces extend immediately to our case. Another justification of this choice lies 
in the fact that neither the classical Wiener space Co (H+ , H) nor 1R IN arc 
Banach spaces (of course they are Frechet!) although we need them badly in 
stochastic analysis 1 . Let T be a measurable transformation of W. The problem 
of determining whether the induced measure v = Tp, where Tp(A) = p(T~ 1 A), 
is absolutely continuous with respect to p and to find the corresponding Radon- 
Nikodym derivative, in particular the case where T is a perturbation of identity, 
i.e., w — * T(w) = w + u(w) where u : W — > H, has been treated extensively (cf. 
[2^ and the references therein) . In this paper we consider the converse problem; 
namely, given a positive random variable L on (W, H, p) such that and E[L] = 1, 
is there a measurable transformation T on W such that (dTp/dp) — LI In 
addition to asking for any T, we will also ask for the existence of transformation 
T belonging to some particular classes which answer to the latter question. To 
be more specific we set: 

Definition 1.1. Let (W,H,p) be an abstract Wiener space and let is be a mea- 
sure on W which is absolutely continuous with respect to p, i.e. dv = Ldp, (or 
a r.v. L such that L > a.s. and E^[L] = \), then: 

1 Readers who prefer to consider the classical Wiener space may, throughout the paper, 
suppose W to be the standard ci-dimensional Wiener space Cq([0, 1], TR d ) with (a) (t,w) — » 
w(t) = Wt(w) being the Wiener process and W endowed with the supremum norm ||itf || = 
sup tg [o 1 i I jp^d and (b) The Cameron Martin space being the space of IR 1 * valued absolutely 

continuous functions h(t) where h(t) = f * h(t)dt with the norm \h\^j = \h(t)\ 2 dt. 
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a) The measure v (or the r.v. L) will be said to be realizable or representable 
if there exists a measurable transformation T such that Tfi = v. Such a T 
will be called a realization of L = dvjd[i. This is equivalent to the validity 
of the following: 

E^[goT] = E^[gL] 1 

for any nice gonW. 

b) The measure v <C fi or the corresponding Radon-Nikodym derivative will 
be said to be "realizable (or representable) by a shift" if there exists an 
H -valued random variable u (a "shift") such that for Tw = w + u(w), 
T[i = v . 

In the class (b) we will also consider several particular types of realizations: 

(bl) The shift u is adapted to some particular filtration on W. 

(b2) The transformation T is "triangular" (to be defined in section [5J. 

(b3) The shift u is the gradient of some random variable ip. 

In addition to these realization problems we will also consider two closely related 
realization problems which will be described at the end of this section. 

We turn now to an outline of the paper. In the next section we summarize 
the notions of gradient and divergence on the Wiener space and some associated 
relations. In section |31 we consider the problem of the realization by adapted 
shifts, i.e. case of (b-lj. This was introduced and treated by Ustiinel and Zakai 
cf. in Section 2.7 of |2jj. Consider first the classical setup of the Wiener space, let 
Tt to be the subsigma field induced by {wg, 9 < t} and consider the realization 
of L by Tw = w + u(w) where u is required to be a.s. in H and adapted to 
the filtration (!Ft,t G [0,1]). In the abstract Wiener space there is no natural 
filtration, however, we can adjoin to the AWS a filtration induced by a resolution 
of the identity on H (cf. e.g. [23|). In [23], among other results, it is proven that 
given L, we can find a sequence of adapted ff-valued shifts, say u n , such that 
measures induced by them, and the corresponding Radon-Nikodym derivatives 
of T n \i (w.r. to \i) L n , satisfy 

lim E[\L - L n \] = 



or 



and the supremum is taken over all finite partitions of W. 

In section0]we present the result of Fernique 0; in our context it states that 
for any v [i then there is a probability space (fl, P) and three random 
variables T, B and Z defined on this space such that T and B are W- valued, Z 
is ii- valued, T{P) = u, B{P) = fj, and T = B + Z. 

Section El represents the work of Bogachev, Koleshnikov and Medvedev 0,0] 
and is based on a transformation of fi to v by a "triangular transformation" . 
This approach yields not only the result that every measure that is absolutely 



Feyel, Ustiinel, Zakai/ Realization of positive random variables on Wiener space 173 



continuous with respect to the abstract Wiener space is realizable, it yields also 
that it is realizable by a shift without any further restrictions on L. 

Section follows the results of Feyel and Ustiinel I^.JTo|. It has its origin in 
the modern theory of optimal transportation (cf. e.g. |3l llll UtI Hil E^l i.e., 
given an absolutely continuous probability (w.r. to the Lebesgue measure) on 
IR™, it can be transformed into any probability measure (3 on 1R" by a trans- 
formation T(x) = V$(x) where $ is a convex function, and this transformation 
is optimal in a certain sense. In and 01 Feyel and Ustiinel extended these 
transportation results to the infinite dimensional setting and some of these re- 
sults are briefly explained here. In particular, the case of Wiener measure can be 
formulated as follows: if a probability measure v on W is at finite Wasserstein 
distance from the Wiener measure fi, where the Wasserstein distance is defined 
with respect to the Hilbert norm of the Cameron-Martin space H (which is 
not continuous on W), then there exists a Wiener function tp which is Sobolcv 
differentiable such that the transformation defined by 

T(w) = w + Vip(w) 

maps (i to v (note that there is no necessity for the absolute continuity of v w.r. 
to //). We indicate some applications of these results to the polar factorization 
of the absolutely continuous perturbations of identity. We further extend these 
results by defining the notion of measures which are at locally finite Wasserstein 
distance from each other and prove that if at least one of them is a spread 
measure, then the other one can be written as the image of the spread one under 
a locally cyclically monotone map of the form of a perturbation of identity. 

Section [7| deals with the following problem. Let y — u(w') + w where w 
and w' represent independent Wiener processes where u(w') takes values in the 
Cameron-Martin space, i.e. u represent a "signal" and w is an additive "noise" 
which is independent of the signal. Let v y denote the measure induced by y 
on W and let L(w) denote the Radon- Nikodym derivative (or likelihood ratio) 
dvy/d^i. This model is relevant to many communication and information theory 
problems which makes it interesting to consider the converse problem: given a 
positive random variable L(w) on (W, H,fj) satisfying E[L] = 1, under what 
conditions does there exist a representation of "signal plus independent noise" 
realization such that the Radon-Nikodym derivative of v y with respect to ji is 
L. This problem was first considered in [2^ and further clarified in |l6j |. 
Remark: A problem which is also related to the problems considered in this 
paper is that of the realization of the random variable L(w) — 1, i.e. the class of 
measure preserving transformation on the Wiener space. This problem is treated 
in [2j| and [3] and will not be discussed in this paper. 

2. Preliminaries 

The gradient: Let (W, H, fi) be an AWS and let e,-, i = 1, 2, . . . be a sequence of 
elements in W* . Assume that the image of in H form a complete orthonormal 
base in H. Let /(xi, . . . ,x n ) be a smooth function on l n and denote by f[ 
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the partial derivative of / with respect to the i-th coordinate. We will denote 
(e, w) by Se. 

For cylindrical smooth random variables F(w) — f(Se±, . . . , 8e n ), define V^F = 
dF(w+eh) ^ Therefore we set the following: V/jF = (VF, h) where VF, the 
gradient, is fFvalued. For F(w) = Se, VF = e, and 

n 

^F = Y J f'Se 1 ,...,5e n )-e i . (2.1) 

It can be shown that this definition is closable in L p (/j,) for any p > 1. Hence it 
can be extended to a wider class of functions. We will restrict ourselves to p = 2, 
consequently the domain of the V operation can be extended to all functions 
F(w) for which there exists a sequence of smooth cylindrical functions F m such 
that F m — > F in L 2 and VF TO is Cauchy in L 2 (fJ,, H). In this case set \7F to be 
the Li{n, H) limit of VF m . This class of r.v. will be denoted ©2,1- It is a closed 
linear space under the norm 

\\F\\ 2A = EUF) 2 +El\VF\ 2 H . (2.2) 

Similarly let K be an Hilbert space and ki, k%, . . . a complete orthonormal base 
in K. Let (p be the smooth K- valued function ip = ^2j =1 fj(Sei, . . . ,6e n )kj 
define 

rn n 
J'=l »=1 

and denote by H>2,i(K) the completion of V</3 under the norm 

yh,i = Ei(\ip\ 2 K + \Vip\ 2 HxK ) . 

For the case where W = Cq[0, 1], VF is of the form VF = J Q a s ds. We will 
denote a s by D S F and therefore for any h — J Q h s ds 

(VF, h) H — f D S F ■ h s ds . (2.3) 
Jo 

The divergence (the Skorohod integral): Let v(x),x € M n take values in R„, 
u(x) = X^i v i( x )Pi> where the pi are orthonormal vectors in R n . Recall, first the 
following "integration by parts formula" : 

(v(x), VF(x))dx = -J F(x)divvdx, (2.4) 

where div is the divergence: div v = J2i f^r- Note that equation Ij2.4|l deals with 
integration with respect to the Lebesgue measure on R„. Here we are looking 
for an analog of the divergence operation on R„ which will yield an integration 
by parts formula with respect to the Wiener measure. 
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Definition 2.1. Let u(w) be an H-valued r.v. in (W,H,fj,), u will be said to 

2 

be in dom2 S if E\u(w)\ < oo and there exists a r.v. say Su such that for all 
smooth functionals f(Se\, . . . , Se n ) and all n the "integration by parts" relation 



E 



hf,u(wj) = E{f-Su) (2.5) 

is satisfied. Su is called the divergence or Skorohod integral. 

A necessary and sufficient condition for a square integrable u(w) to be in 
doni2 S is that for some 7 = 7(1*), 

E(u(w),Vf) H \ <jEif(w) 

for all smooth /. For non-random h e W* , 5h — (h,w), setting / = 1 in (|2.5|l 
yields that ESh = 0. It can be shown that under proper restrictions 

5(f{w)u(wj) - f(w)Su - (v/,uH) fl . (2.6) 
2 

Consequently, if E\u\ H < 00, and Vu is of trace class then 

(u, w) — Su + trace Vu . (2.7) 

w* 'w 

Among the interesting facts about the divergence operator, let us also note that 
for the classical Brownian motion and if (u(w))(-) — J u s (w)ds where ii s (w) is 
adapted and square integrable then Su coincides with the Ito integral i.e. 

Su = u s (w)dw s . (2-8) 
Jo 

Notation: Let u(w) be ff-valued and in the domain of the divergence. We will 
use p{u) to denote the exponential function 

p{u) :=exp(su~hu\ 2 H ) . (2.9) 



3. Realization of positive functionals by adapted shifts 

Let (W, H, fi) be the classical 1-dimcnsional Wiener Process. The expectation 
operator E and "a.s." will be with respect to the measure /i. Let v be a prob- 
ability measure on W and v <C /i- Set L(w) — dv/d/i. Then by the integral 
representation for L £ L 1 ^) 

L{w) = 1- f i(w)dw s (3.1) 
Jo 
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where £. is adapted, and J* i 2 (w) ds < oo a.s. Let L t (w) — E(L{w)\T t ) then 

L t (w) = 1 - / i s (w)dw s . 
Jo 



(3.2) 



Set 

a s (w) =L-\w)t(w) ifL s H^0 

= i£L a (w) = 

Further assume that f Q a 2 {w)ds < oo a.s. (a sufficient condition for this is 
v <~ n). Then L t {w) solves the equation 

L t {w) = l- [ L s (w)a s (w)dw s (3.3) 
Jo 



and 

ft 



L t (w) = cxp ^- y a s («;) d«7 S - i y 



dj(w)ds . (3.4) 



The relation between the shift Tw = w + J Q d s (iu)ds and L is given by the 
Girsanov theorem as 

T{L l i)= i i. (3.5) 

We have also the following result which assures the realization of v — L ■ /x: 

Proposition 3.1. Let u, L, a and T be as defined above. Assume that the process 
w — > y(w) = (yt(w),t G [0, 1]) solves the equation 

y t {w)=w t - / a s (y)ds (3.6) 
Jo 

in the strong sense and that 

/ c\ 2 {y)ds < oo a.s. (3-7) 
Jo 

Let S : W — > be the map defined by y(w) = S(w), i.e., Sw = w+ J Q a s (Sw)ds. 
Then 

(i) S is the right inverse of T : 

ToS = I w 

almost surely and S realizes v, i.e., S/j, = v and Sfi ~ \i. In fact T is 
almost surely invertible and S is its (both sided) inverse, 
(ii) Let M be defined as 



1 



M = exp( / a s (y)dw s - | / \a s (y)\ 2 ds ) , 
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then 
and 

LMoT = l (3.8) 

li-almost surely. 

Proof. It is clear from (|3.6[1 that ToS(w) — w almost surely. Moreover, it follows 
from the hypothesis l|3.7[l , the random variable 

M = exp(J a s (y)dw s - i J \a s (y)\ 2 ds 

is almost surely finite. Using the usual stopping time technique and Fatou 
Lemma we have -E[M] < 1. Moreover, again using the stopping technique and 
the Jessen theorem, cf., 23], Theorem 1.2.1, it follows that 

E[f o S M] = E[f] , 

for any / 6 Cb(W). Since M > almost surely, Sfi is equivalent to fi. Moreover, 
we have 

LoSM = l, (3.9) 
almost surely. To see that Sfi = v, we have, for any nice function / on W, 

fLdfi = [ foSLoSMdfi 
w Jw 



f o Sdii, 

w 

by the relation l|3.9(l . To show that S o T = Iw almost surely, note first that S 
is almost surely subjective: in fact, using the Girsanov theorem, we get 



ti(S(W)) = / l 8 (W)dn 



ls(W) ° S(w) M(w)dfj,(w) 
u 

Md(i = l. 
w 



Hence S(W) — W almost surely. Moreover, on the set W\ — {w G W : To 
S(w) = w} the map S is injective and (t(Wi) — 1 and this completes the proof 
of (i). For the proof of (ii), the first part follows from (i) and from the Girsanov 
theorem: 



/ oT dfi = J f o T o S Mdfi 
fMdfi, 
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for any nice / on W. For the second part we know already that L o S M = 1 
and that Tfi ~ /i since M > ^i-p.s. Hence it suffices to compose this identity 
with T. □ 
In the converse direction we have the following result which is based on the 
assumption invertibility of adapted absolutely continuous shift. 

Proposition 3.2. Assume that v is realizable by 

Tw = w + / u s (w)ds (3.10) 



where u. is adapted, u 2 (w)ds < oo a.s., v = T/i <C /i and L = dv j dii. Further 
assume thatT is right invertible (TT^ 1 = Iw a.s.) and \u s (T~ 1 w)\ 2 ds < oo. 
Then T satisfies the equation ( i|3.6[l ) with a(w) — —u(T~ 1 w) and 

L(w)=exp(-J^ u s (T~ 1 w)dw s -^ J MT^uOlW) , (3.11) 
in particular v ~ /i. 

Proof. Since T is right invertible, by (|3.1Q(I we have 

T~ 1 w = w- I u s {T- 1 w)ds. (3.12) 
Jo 

By Theorem 2.4.1 of [23| (T -1 )^ < [i. Moreover, since J Q \u s \ 2 ds < oo a.s., it 
follows by Theorem 2.4.2 of [23 that Tfi ~ fj,. Set A to be 

A(w) = exp ( { u s (w)dw s - \ ( \u s \ 2 (w)ds 



by the Girsanov theorem (cf. 23]) 

dfx 



A=( d ^o T y = (LoTr 



Hence L = (A o J 1 " 1 )" 1 and 

L = cxp(— / u s oT~ 1 dw s — — [ \u s o T^ 1 \ 2 ds 



□ 

A complete characterization on the representability of Radon-Nikodym deriva- 
tives by adapted shifts is not known at present. We have, however the following 
result (cf. section 2.7 of |2^| for the proof): 

Theorem 3.3. The set of measures that are realizable by adapted shifts is dense 
in the space of measures which are absolutely continuous with respect of the 
Wiener measure under the total variation norm, i.e. if on an abstract Wiener 
space v -C then there exists a sequence of invertible and adapted shifts T n 
such that dT n fi/d[i converges to dv/dp: in the L 1 norm. 
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In the rest of this section we shall make some further observations about the 
problem. First let us a necessary and sufficient condition for the representability 
of a positive random variable: 

Proposition 3.4. Let U — Iw + u, where u : W — > H is adapted. Denote 
p(—6u) (cf. equations 1)2. 8J1 . (|2.9J) ) by A and assume that E[A] = 1. Then Up — 
L p if and only if 

E[A\U] L o U = 1 

almost surely. 

Proof. Necessity: from the Girsanov theorem, we have, for any / 6 Cb(W), 

E[f] = E[f o UK] = E[f oUE[A\U]] , 
note that E[A\U] = 6 o U for some measurable 9 : W — ► H. Hence we get 

E[f)=E[f6L]. 

This implies that 9 L — 1 almost surely, hence 9 o U L o U = E[K\U] L o U = 1 
almost surely. 
Sufficiency: We have 

E[foU] = E[f oU LoU E[A\U]\ 
= E[f oU LoU A] 
= E[fL], 

which completes the proof. □ 

Proposition 3.5. Assume that L is given by p(—6v), where v : W — > H is such 
that v is adapted, where v(t,w) — J Q v s (w)ds, t G [0,1] and that the mapping 
h — > v(w+h) is fi-almost surely continuous and compact on the Cameron- Martin 
space H . Finally assume that 

y \v(w + h)\ H 

hm sup — = a(w) , 

\h\ H ^oc l^lff 

such that < a(w) < 1 fi-almost surely. Then w — > w + v(w), denoted as V(w), 
has a left inverse Ui — Iw + v>, where u is also adapted and Ui realizes L. In 
fact Ui is also a right inverse almost surely. 

Proof. The hypothesis about v imply that the mapping V has a left inverse 
Ui thanks to Leray-Schauder degree theorem (cf.|23, p. 247, Theorem 9.4.1 
and Remark 9.4.4). It is clear that Ui is of the form I\y + u, with ii adapted. 
Consequently, using the Girsanov theorem, we get, for any / € Cb(W), 

E[f L] = E[foUioVp(-Sv)] 
= E[foU). 
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To show that Ui is also an almost sure right inverse let us remark that it suffices 
to prove the subjectivity of V since it is already injective almost surely by 
the existence of a left inverse. To show the subjectivity we need to prove that 
V(W) = W almost surely. First the fact that V(W) is measurable with respect 
to the universal completion of B(W) is given in Theorem 4.2.1 of [2j|. It then 
follows from the Girsanov theorem that 

H(V(W)) = E[l v{w) oVL] 
= E[L]=1, 

since ly(w) V = 1. □ 
We have also the following result: 

Proposition 3.6. Assume that L = p(—5v) with v adapted and E[L] — 1. Let 

V = Iw + v and represent dVfi/dfj, by p{—5u) and let U = Iw + u. Then U o V 
is a rotation, i.e., U o V(fi) = p, consequently the stochastic process (t,w) — > 
U o V(t, w) is a Wiener process under pi, with respect to its own filtration. If it 
is also a martingale with respect to the canonical filtration, then U o V = Iw 
almost surely. In other words U is a left inverse of V and it realizes L. In fact 
U is also right inverse [i-almost surely. 

Proof. For any / G Cb(W), we have 

E[foUoV] = E[foUp(-Su)} 
= E[f], 

from the Girsanov theorem and this shows that Uo V is a rotation. Consequently 
it can be written as 

UoV = B, 

where B is a Wiener process with respect to its own filtration. Besides, writing 
in detail the above relation, we get 

B(w) = V(w) + u o V(w) 

— w + v(w) + u o V(w) . 

If B is also a (continuous) martingale with respect to the canonical filtration, 
then the adapted and finite variation process v + u o V becomes a martingale, 
hence it is almost surely zero and this implies that U o V = Iw almost surely. 
To complete the proof of realization of L by U it suffices to reason as in the 
proof of Proposition 13. 51 The proof of the fact that U is also a right inverse a.s. 
is also similar to that of Proposition 13. 51 □ 

Proposition 3.7. Assume that L = p(—8v) with E[L] = 1 and v adapted. Let 

V = Iw + v assume also that dVp/dp = p{—8u) and let U = Iw + u. If L is 
a (V) -measurable then V is almost surely invertible and its inverse is U . 
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Proof. From the Girsanov theorem, we have, for any / 6 Cb(W), 
E[f oV p{-5u)oV L] =E[fp(-6u)] = E[f o V] . 

Hence we get 

p{-5u) o VE[L\a(V)] = 1 
almost surely. The hypothesis implies that 

p(-Su) o VL = 1 

almost surely. Hence we have 

exp (~6uoV- ^\uo V\ 2 H ^j exp f-Sv- ^\v\%\ = 1 (3.13) 
almost surely. Note that 

Su o V = Su + (u o V, v)h ■ 
From the equation H3.13|l we get 

= Su+ (uoV,v) H +8v + ^(\uoV\ 2 h + \v\%) 
= S(u + v) + ^\uoV + v\ 2 H . 

Taking the expectation of both sides we obtain 

u o V + v = 

almost surely, i.e., Iw = Iw + v + uoV = UoV almost surely, which means 
that U is a left inverse of V. One can show that it is also a right inverse as we 
have already done in the preceding proofs. □ 



4. The (weak) representability of positive random variables via 
perturbations along the Cameron-Martin space 

In this section we present the following result of Fernique Q 

Theorem 4.1. Let (W,H,pi) be an abstract Wiener space. Let v <C p, then 
there exist a probability space (f2, TL, P) and three random variables T, B and Z 
defined on it such that T, B : Q — > W and Z : Q — > H such that T(P) = v , 
B(P) = p and that T = B + Z. 

The results of the following lemma are not new fcf. [2lll2^ . l23j and the references 
there), we give a quick proof for the sake of completeness: 
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Lemma 4.1. Suppose that F : W <— > TR is a measurable random variable satis- 
fying 

\F(w + h)-F(w)\<c\h\ H , (4.1) 

almost surely, for any h £ H , where c > is a fixed constant. Then F belongs 
to B Pi i for any p > 1 and \7F £ L°°(/i, H). Moreover we have 



V{\F\ > A} < 2 exp ■ 



(X-E[F}) 2 



for any A > 0. 



Proof. We shall give the proof in the frame of a classical Wiener space, the 
abstract Wiener space can be reduced to this case as it is explained in Chapter 
2 of [1^. Remark that there is no information about the integrability of F. 
In fact this is a consequence of the hypothesis (|4.1fl as explained below: Let 
F n — \F\ A n, n £ IN. A simple calculation shows that 

\F n (w + h) -F n (w)\ <c\h\ H , 

hence F n £ D Pi i for any p > 1 and |VF„| < c almost surely. We have from the 
Ito-Clark formula (cf.j^l), 



F n = E[F n ] + f E[D s F n \T s ]dw Sl 
Jo 



where D S F is as defined by From the definition of the stochastic integral, 

we have 



E 



1 , 
E[D s F n \J 7 s )dw. 



= E 



< E 



\E[D s F n \T a }\ 2 ds 



\D s F n \ 2 ds 



= E[\VF n \ 2 ] 
< c 2 . 

Since F n converges to \F\ in probability, and the stochastic integral is bounded 
in L 2 (/i), by taking the difference, we see that (E[F n ],n £ IN) is a sequence 
of (degenerate) random variables bounded in the space of random variables 
under the topology of convergence in probability, denoted by L°(/z). Therefore 
sup n n{E[F n ] > — ► as t — > 00. Hence lim„ E[F n ] = £^[1^1] is finite. Now 
we apply Fatou's lemma to obtain that F £ L 2 (/j,). Since the distributional 
derivative of F is a bounded random variable, F £ B Pi i, for any p > 1. From 
the above lines, replacing F by F — E\F\, we can suppose that E[F] = 0. Let 
(ei) G if be a complete, orthonormal basis of H . Define V n = a{6ei, . . . , Se n } 
and let ip n = E[Pj_ / n F\V n ], where Pt denotes the Ornstein-Uhlenbeck semi- 
group on W . Then, from Doob's Lemma, 



fn = /n(<5ei, . . . , Se n ). 
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Note that, since /„ G f] k W Pj fc(IR™, fi n ), the Sobolcv embedding theorem im- 
plies that after a modification on a set of null Lcbcsguc measure, f n Ccin be 
chosen in C°°(JR n ). Let (B t ; t G [0, 1]) be an ET-valued Brownian motion. Then 

l*{\<p n \>c} - P{\f n (B 1 )\>c} 

< P{ sup \E[f n {B 1 )\B t ]\>c} 

t€[0,l] 

= P{ sup \Qi- t f n {Bt)\>c}, 

t€[0,l] 

where P is the canonical Wiener measure on C([0, l],]R n ) and Q t is the heat 
kernel associated to (B t ), i.e. 

Q t (x,A) = P{B t + xeA}. 

From the Ito formula, we have 

Qi- t MB t ) = Qifn(Bo) + f (DQ 1 _ s f n (B s ),dB s ). 

Jo 

By definition 

Qifn(Ba) = Qi/ n (0)= f fn(y)-Qi(0,dy) 




= E [E[P 1/n F\V n ]] 

= E [P 1/n F] = E[F] = . 

Moreover we have DQ t f — QtDf, hence 

Qi-tfn(B t ) = [ (Qi-,Df n (B,),dB a ) = M? . 
Jo 

The Doob-Meyer process ((M n ,M n ) t ,t £ IR+) of the martingale M™ can be 
controlled as 

(M n 1 M n ) t = ( \DQi-.f n (B.)\ 2 d8 
Jo 

< I \\D.fn\\ 2 Cb d S = t\\Vf n \\ 2 Cb 

Jo 

= t\\V.f n \\ L ~ (Mn) 

< t\\VF\\ 2 L ^ H) . 

Hence from the exponential Doob inequality, we obtain 

P | sup |Qi_ t /„(B t )| > A 1 < 2exp J - f 2 1 . 

[te[o,i] J { 2 II v ^IIl~ (aJ: h) J 
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Consequently 



n{\(fi n \ > A} < 2exp ■ 



Since ip n — > ip in probability the proof is completed. □ 
The following proposition will play the intermediate role in proof of Theorem l4.ll 

Proposition 4.2. Let fi be the standard Gaussian measure on 1R" and let v 

be another probability on 1R™ such that v -C [i and that the Radon- Nikodym 
derivative is essentially bounded by a constant A < oo. Then there exist 
a Gaussian random variable B with values in TR n , and an 1R" -valued random 
variable Z, both defined on (fl,H.,P), satisfying 



E\\Z J -A 

and the probability measure on TR n induced by T = B + Z is v. 

Proof. For the proof we shall use basically the Kantorovitch-Rubinstein theo- 
rem: Let g : IR n — > H be Lipschitz with Lipschitz constant 1. By Lemma [4. II 
and denoting by E^, E v the expectation with respect to the /i and v measures 
respectively: 

//{ g-E^g] > t} < 2exp- 

Therefore 2 

i/jlp-^^l >t} < Aexp-*-. (4.2) 

-t 2 

Note that if for some r.v. Y it holds that ip(t) = Prob{|F| > t} < Ae~ then, 
from the Fubini theorem 



e ~dt = A 



2 



E[\Y\] = / tp(t)dt < A 
Jo 

Therefore by ljO]l 

em- em 



< E v 



9 -EM 



(4.3) 



Define, for any two measures a and j3 on W 

K(a, fS) — sup / gda — I gdp 

llflllLio<l J J 



where the supremum is over all Lipschitz functions with Lipschitz constant 1. 
From l|4.3|l 

(4.4) 



KM < A \l2 
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Now let N((i, v) be the Rubinstein distance between \x and v: 



N(ji, v) = inf / \x-y\ w M(dx,dy) 

JWxW 



(4.5) 



where the infinum is taken over the set £ which consists of all the probability 
measures M on W X W for which M(dx, W) — fx(dx) and M(W,dy) = v{dy). 
Note that the set £ is compact w.r.to the weak topology of the measures on 
W xW and the integral at the r.h.s. of (|4.5|1 is lower semicontinuous under this 
topology. Hence there is at least one measure Mq which realizes the infimum 
provided that the infimum is finite. Moreover, by the Kantorovitch-Rubinstein 
theorem (cf. e.g. 0, 11.8.2]), K([i,v) = N{^jl,v) and this implies the finiteness 
of N(n, v) hence also the existence of Mo(dx, dy) on W x W with the prescribed 
properties. Letting Q = W x W, P = M , B = JX X , T = n 2 and Z = T - B, 
where Hi, i = 1,2 are the projection maps from W x W to W, we see that 
T(P) = v, B(P) = (J, and that T = B + Z. □ 
Proof of Theorem 14.11 assume first that L = ^ < A < oo. Let (e,, i > 1) C 
W* be a complete orthonormal basis of H, set 



ir N {w) = w N 



N 



1 



(w,ei)ei 



and let denote the measures on IR^ which are the images of [i and v 

under 7r/\r. Let TV and Bn be as in the proof of the proposition, then (TV, N > 1) 
and (Bn,N > 1) converge in law and by (|4.4() 



T 



N 



B 



N 



H 



It follows by weak convergence and the Skorohod representation theorem (cf. 
e.g. theorem 7.1.4. and sections 11.5, 11.7 of 6J), there exists a probability space 
(fi, H, P) and that there exists a sequence of random variables (Tj^, B' N , N G IN) 
which converges a.s. to (T',B') such that the laws of T' and B' under the 
probability P are ^ and /i respectively and moreover, by Fatou's lemma and 
by the lower semi continuity of the Cameron-Martin norm with respect to the 
topology of W, 

rjr 



Ep 



T' - B' 



H 



< A 



In order to complete the proof it remains to remove the restriction about the 
boundedness of the density L. Let A — 0, A n = 2 n_1 and W n — {w : L{w) G 
[A n -i,A n )}. Set L n (w) = l Wn L(w). Set 



if / L n d[i > 



dji otherwise 
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Applying the results for a, v n as above yields, for any n > 1, (T",B'^) denned 
on (Q n , Hn,P n ) such that T^(P n ) = v n and B^(P n ) = u with 



Ep n 



H. 



< 



J L n da 



T' r [ K 

if / L n d\i > 0. Let f2 be the disjoint union of (O n , n > 1) and define 
Define, for lu G f2„ 

then T"(P) = v and B"(P) = u and Z" = T" - B" e H almost surely. 



□ 



5. Realization by triangular transformations 
5.1. Triangular transformations 

A map T : IR™ — > IR™ is said to be triangular if 



T 2 (x 1 ,x 2 ) 



\Pn {x\ , . . . , X n ^j y 



where Tj : IR* — > IR . A triangular map is called increasing if for every i = 
1, . . . , n, Xi — > Ti(xi, . . . ,Xi) is non decreasing. 

Proposition 5.1. Let /x and t/ 6e probability measures on IR™. Assume that [i 
and v are absolutely continuous with respect to the Lebesgue measure on IR™, 
then there exists an increasing triangular transformation T on IR™ such that 

Ta = v . 

Proof. We proceed by induction on dimension: (a) Consider first the case n = 1. 
Let F tl (x) and F v (x), x g H 1 denote the distribution functions of a and v 
respectively. Assume, that F v is strictly increasing, then for every x G IR 1 
there is a unique y £ IR 1 such that F /Jj (x) = F u {y). Set T fl _ v {x) = y then 
FviT^vX) = F^(x). Since F v is strictly increasing, F v is invertible and 



T^x = F-\F^x) 



(5.1) 



transforms a into v and T Mi „ is strictly increasing. If F v is not strictly increasing, 
set 

F~\y) := inf (j3:F v (P)>y). 
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Note that with this interpretation of F~ , T Mj „ is increasing and equation (|5.1|l 
proves the proposition for n = 1. 

(b) Consider now the case n = 2, let <p{xi,X2) and ip(xi,X2) denote the proba- 
bility density with respect to the Lebesgue measure of \i and v respectively. Let 
ip Xl {x2) and ip Xl (x2) denote the regular probability densities of x-i conditioned 
on xi, i.e. 

(p(x 1 ,x 2 ) = ipi(x 1 )ip Xl (x 2 ) 

and 

ip(xi,x 2 ) = tpi(xi)~ip Xl (x 2 ) ■ 

Let T\ : K 1 — > H 1 denote the transformation on H 1 transforming c?/^i = 
ipi(xi)dxi to the measure = ip(yi)dyi as constructed in part (a). Next, 
let T<2{xi,X2) be the map transforming, for fixed x\, the measure ip Xl (x2)dx2 to 
the measure ip Xl (x 2 )dx2- Then 

is an increasing triangular mapping and T\x = v. 

(c) For n > 2 the result follows by induction using the conditional density of x n 
given xi,...,x n -i. □ 
It follows from the proof by induction that 

Corollary 5.2. Provosition I5.il holds for H°°, provided that the projections 
of the measures \x and v to TR n are absolutely continuous with respect to the 
Lebesgue measure for any n > 1. 



5.2. The realization of a Radon- Nikodym derivative by a triangular 
transformation: 



Let (W, H, /x) be an AWS. Let (e 2 ;, i > 1) C W* be an orthonormal basis 2 for H 
By the Ito-Nisio theorem (cf. e.g. [l^ or [H) ^ x 







[15J or |2 

as N — > oo. Let f <C /x then as shown in the previous section we can construct 
an increasing triangular transformation T: 



Tw = y^Tj{w)ej 

i 

TiW = gA ,{ei,w) ie^w) ) 



(5.2) 



such that the measure induced by Tw on W is ia This yields a measurable 
transformation which induces v and yields another proof to the result of Section 
0] that even if v <C /i then z/ is realizable. 



2 We do not distinguish £ W* from its image in if there is no confusion. 
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5.3. The entropy associated with a triangular transformation: 

Let n denote the standard Gaussian measure on IR™ and let T denote a C 1 
diffeomorphic map on IR™. By the change of variables formula, it holds that for 
every g e C h (IR n ) 

/ g o T(x)\A(x)\du(x) = / g{x)d[i{x) (5-3) 
where A is given as follows: set T(x) = x + u(x) then 

1 



A(.t) =det(7 + Vu)ejq)|-(«(x),x)iRr. - -\u{x)\^ j (5.4) 

(cf. e.g. [13 )• The Carleman-Fredholm determinant of an n x n matrix 1^ + A, 
det 2 (I]r*» + A), is defined as 

det 2 (/]R» + A) = det(7 R n + A)e" traceyl 

and A can be rewritten as 

A = dct 2 (7 + Vu) exp | trace Vu — (u(x),x) — ^\u(x)\^ 1 (5.5) 



and by fTfy : 



A = dct 2 (I + Vu) exp <j 6u - -|u(a;)| 2 } ■ (5.6) 



The reason of rewriting A with the modified Carleman-Fredholm 3 determinant 
lies in the fact that det2(/+A) is well-defined for the Hilbert-Schmidt operators, 
in fact A — > det 2 (/ + A) is analytic on the space of Hilbert-Schmidt operators 
on an infinite dimensional Hilbert space, while A — > det(J + A) is defined only 
for the nuclear (or trace class) operators. Assume now that the measure induced 
by T is absolutely continuous with respect to /z, T\x fi, then 



g o T(x)djji(x) = / f(x)L(x)d/j,(x) (5-7) 
ir" Jm n 

where L is the Radon-Nikodym derivative. Hence comparing (|5.7() with (|5.3|l 
and recalling that T is invertible we have 

L{x) = AiT^x) 1 (5.8) 
where A is as defined by (|5.3I) . 

Let L be the Radon-Nikodym derivative of v with respect to [i and assume that 
LlogL G L 1 ^!) set L{x)\ogL(x) — whenever L(x) = 0. The relative entropy 
H(p\u), of v with respect to fi is defined as 

H(v\n) = Epilog L) = E v (logL) . 



3 cf. Chapter 3 of 23] for details. 
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Proposition 5.3. Let T be a C 1 and invertible, triangular and increasing map 
on IR", let /i be the standard Gaussian measure on IR™, assume that T\i <C [i 
and L = dT\ijd[i satisfies LlogL 6 Then, setting T(x) = x + u(x), it 

holds that 



\E v [\u\ 2 ]<H{v\n). 
Proof. By (JEEJ) and by (£3 and then JESJ 



(5.9) 



IR" 



L(x) log L(x)ci/Lt(x) = / L(x)log A(T _1 x) e^(x) 



— log \A(x)\da(x) 
/ (log det 2 (/iR" + Vu(x)) 



+ trace Vu(x) — (w(x), x) — — [u(x)| j dfi(x) . 

Therefore we get the simple form for Tt(is\fi): 

[LlogL] = ^ M [|u| a ] - [log |det a (J + Vu)|] 

= ^[\u\ 2 ]+E^[logdet 2 (I + Vu)] 

where the last line follows since det 2 (/ + Vu) £ [0, 1] almost surely as explained 
below: Tx = x + u(x) is triangular and increasing, therefore Vit is a triangular 
matrix whose diagonal elements are all non-negative. Hence setting du ^ x ^ = 
a i > 0, yields 



det 2 (7 + Vu) = JJ(l + ai)e 



Since for a > 0, < (1 + a)e~ Q < 1 it follows that < det 2 (J + Vu) < 1, 
and (|5.9|l follows. □ 



5-4- Tfte realization by H -valued shifts 

Theorem 5.4. Let (W,H,fi) be an abstract Wiener space, define a probability 
measure v on (W, B(W)) as dv = Ld/i where L > with E[L] = 1. T/ien i/iere 
exist a measurable TonW such that Tfi = f and fi-almost surely Tw — w 6 H , 
j. e. T is a perturbation of identity. 

Proof. Assume first that L logL e L 1 ^). Let (e*, z > 1) be as in subsection 15. 21 
let (j. n and v n be the images of probability measures a and v induced on IR™ by 
the map w — > 7r n (w) = ((ex, ttf) , . . . , (e n , iu)). Then i/„ <C set /„ = dv n jd\i n 
then Z n o 7r„ = L„ is the conditional expectation of L conditioned on the u-field 
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generated by w — > ir n (w) = ((ei,tu), . . . , (e n ,w)). Since s — * slogs is a convex 
function on IR + , it follows by Jensen's inequality that 

E„[L n logi„] < ^[ilogi] < CO. 

Let £ n be the triangular map such that t n fi n = u n , define T n = t n o 7r n , then 
from the construction of subsection 15.21 lim„ T n = T almost surely. Setting 
u n (w) = T n (w) — W, we have from Proposition 15 . 31 that for all n 

sup£ M [\u n \ 2 H ]< 2E^[LlogL] 

n 

hence by Fatou's lemma E[\Tw — w\ 2 H ] < oo. 

In order to remove the restriction LlogL £ Li(/i) we proceed as follows: parti- 
tion the space W into sets E m = {w : m < L{w) < to+ 1}, m G IN, avoiding sets 
on which u(E m ) = 0. Set u m {A) = v{A n E m ) for all Borel sets A in W. Next 
partition the real line by intervals D m such that ^i(D m ) = v(E m ) where fii is 
the standard 1-dimensional Gaussian measure on Mei. Consider the decompo- 
sition n = ii\ <8> jjl where \i! is the (Gaussian) measure defined as ~ 7r i)(/ i )- 
Define \i\^ m on the real line as n\, m {B) = ^{B n D m ) for B 6 B(JR). Then 

tn 

Define t m : IRei — > Hei such that t TO (/ii, m ) = v{E\)n\. Then (t m x iivOG^lim® 
/i') = v(E m ) fi. Transform now is(E m )fi in to ^ m as in the first part of this 
subsection with a triangular transformation, say U m and define 

T m — U m o (t m x 

where W' = 7T 1 " 1 {0}. Define finally T as to be T m on the set D rn x W'. Note 
that T m (w) — w is a.s. H- valued. Since for mi 7^ rri2, the domains of T mi and 
T m2 are disjoint and so are their corresponding ranges since they are contained 
in the supports of v mi and z/™ 2 respectively. Consequently (T m ,m > 1) yields 
a triangular transformation transforming fi to v and such that Tw — w in a.s. 
in the Cameron-Martin space. □ 

6. The realization by gradient shifts 

Let W be a separable Frechet space with its Borel sigma algebra B(W) and 
assume that there is a separable Hilbert space H which is injected densely and 
continuously into W, thus the topology of H is, in general, stronger than the 
topology induced by W. The cost function c : W x W — > 1R+ U {00} is defined 
as 

c(x,y) = \x- y\% , 

we suppose that c(x, y) = 00 if x ~ y does not belong to H. Clearly, this choice 
of the function c is not arbitrary, in fact it is closely related to Ito Calculus, 
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hence also to the problems originating from Physics, quantum chemistry, large 
deviations, etc. Since for all the interesting measures on W, the Cameron-Martin 
space is a negligible set, the cost function will be infinity very frequently. Let 
S(p, v) denote the set of probability measures on W x W with given marginals 
p and v. It is a convex, compact set under the weak topology a(E, CbiW x W)). 
The problem of Monge consists of finding a measurable map T ; W — > W, called 
the optimal transport of p to i.e., T p — v which minimizes the total cost 



between all the maps U : W — > W such that Up = v. On the other hand the 
Monge-Kantorovitch problem consists of finding a measure on W x W, which 
minimizes the function 9 — > J (9), defined by 



where 9 runs in E(/?, v). Note that inf{J(0) : 9 £ E(p, v)} is the square of 
Wasserstein metric v) with respect to the Cameron-Martin space H. Any 

solution 7 of the Monge-Kantorovitch problem will give a solution to the Monge 
problem provided that its support is included in the graph of a map. 

Let us recall some notions of convexity on the Wiener space (cf. @,H^|)- Let 
K be a Hilbert space, a subset S of K x K is called cyclically monotone if 
any finite subset {{xi,yi), . . . , (xn, Un)} of S satisfies the following algebraic 
condition: 

(yi,X2 -x\) + (y2,x 3 - x 2 ) H h (y N -i,x N - x N -i) + (vn,xi -x n ) < 0, 

where (•,•} denotes the inner product of K. It turns out that S is cyclically 
monotone if and only if 



for any permutation a of {1, . . . ,iV} and for any finite subset {(xi, j/j) : i — 
1, . . . , TV} of S. Note that S is cyclically monotone if and only if any translate of 
it is cyclically monotone. By a theorem of Rockafellar, any cyclically monotone 
set is contained in the graph of the subdifferential of a convex function in the 
sense of convex analysis (|l9||) and even if the function may not be unique its 
subdifferential is unique. 

A measurable function defined on (W,H,fj) with values in IR U {oo} is called 
1-convex if the map 





(6.1) 




f(x + h) + -\h\l = F(x,h) 



is convex on the Cameron-Martin space H with values in L°(p). Note that 
this notion is compatible with the /i-equivalence classes of random variables 
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thanks to the Cameron-Martin theorem. It is proven in ||| that this definition 
is equivalent the following condition: Let (7r„,n > 1) be a sequence of regular, 
finite dimensional, orthogonal projections of H, increasing to the identity map 
Ih- Denote also by 7r n its continuous extension to W and define ir^ = Iw ~ it n . 
For x £ W, let x n — ir n x and x^ — ir„i. Then / is 1-convex if and only if 

x n 2^ Xu ^'h ~^ + X n) 
is Tr^/i-almost surely convex. 

Definition 6.1. Let £ and rj be two probabilities on (W,B(W)). We say that a 
probability 7 on (W X W,J3(W x W)) is a solution of the Monge-Kantorovitch 
problem associated to the couple (£, rj) if the first marginal of 7 is £, the second 
one is 77 and if 

Jil) = I \x- y^d-fix, y) = inf ( / \x - y\ 2 H d(3{x, y) : £ E($, 77) 

where rf) denotes the set of all the probability measures onWxW whose first 
and second marginals are respectively £ and rj. We shall denote the Wasserstein 
distance between £ and rj, which is the positive square-root of this infimum, with 
dnitv)- 

Remark: By the weak compactness of probability measures on W x W and the 
lower semi-continuity of the strictly convex cost function, the infimum in the 
definition is attained even if the functional J is identically infinity. 
The following result, whose proof is outlined below (cf. alsojflllicj) is an extension 
of an inequality due to Talagrand and it gives a sufficient condition for the 
Wasserstein distance to be finite: 

Theorem 6.1. Suppose that L is a positive random variable with LlogL £ 
and with E[L] = 1. Let v be the measure dv — Ldji, we then have 

d 2 H {y,n) <2E[L\ogL\. (6.2) 

Proof. Let us remark first that we can take W as the classical Wiener space 
W = Co([0, 1]) and, using the stopping techniques of the martingale theory, we 
may assume that L is upper and lower bounded almost surely. Then a classical 
result of the Ito calculus implies that L can be represented as an exponential 
martingale 



L t = cxp < — / u T dW T — — I \u T \ dr 



o 



with L — L\, Let us define u : W — > H as u(t, x) — L ii T dr and U : W — > W as 
U(x) = x + u(x). The Girsanov theorem implies that x — > U(x) is a Brownian 
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motion under v, hence /3 = (U x I)v G v). Let 7 be any optimal measure, 
then 

J( 7 ) = 4(^M)< / 
= 2£[ZlogL], 

where the last equality follows also from the Girsanov theorem and the Ito 
stochastic calculus. □ 
Combining Theorem 16. II with the triangle inequality for the Wasserstein dis- 
tance gives: 

Corollary 6.2. Assume that vi (i = 1, 2) have Radon- Nikodym densities Li (i = 
1,2) with respect to the Wiener measure \x which satisfy E\Li(w) log Li(w)\ < 
00, i = 1,2. Then 

dn{vi: v-i) < 00 . 
Let us give a simple application of the above result 

Corollary 6.3. Assume that A 6 B(W) is any set of positive Wiener measure. 
Define the H -gauge function of A as 

q A {x) = M(\h\ H : h&{A-x)f\H). 

Then we have 
in other words 

KA)<exp> ^ 



Similarly if A and B are H -separated, i.e., if A e HB = 0, for some e > 0, where 
A £ — {x e W : q A (x) < s}, then 



and consequently 



n{A)p(B) <exp ( -- 



Remark 6.1. We already know that, from the — 1-law, qA is almost surely 
finite, besides it satisfies \q A (x + h) — q A (x)\ < \h\n, hence the hypothesis of 
Lemma \4 - 1\ are satisfied. Consequently i?[exp \q A ] < 00 for any A < 1/2 (cf. the 
Appendix B.8 of ]2A] and l2a l). In fact all these assertions can also be proved 
with the technique used below. 
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Proof. Let va be the measure denned by 

dv A = — ^—-IaoI/i . 
fi(A) 

Let be the solution of the Monge-Kantorovitch problem, it is easy to see that 
the support of -fA is included in W x A, hence 

\x - u\h > in£{\x -z\ H ■ z e A} = ^(x) , 

7^4-almost surely. This implies in particular that qa is almost surely finite. It 
follows now from the inequality H6.2|l 

E[q A ] < -21ogM(A) ! 

hence the proof of the first inequality follows. For the second let B = A c e and 
let 7a_b be the solution of the Monge-Kantorovitch problem corresponding to 
va, vb- Then we have from the Corollary 16. 21 

d 2 H {v A ,v B ) < -41ogM(>4)M(B). 

Besides the support of the measure 7,43 is in A x B, hence 7 J 4B-almost surely 
\x — v\h > £ and the proof follows. □ 

We call optimal every probability measure 4 7 on W x W such that J (7) < 00 
and that J (7) < J (8) for every other probability 6 having the same marginals 
as those of 7. We recall that a finite dimensional subspace F of W is called reg- 
ular if the corresponding projection is continuous. Similarly a finite dimensional 
projection of H is called regular if it has a continuous extension to W. 

The proof of the next theorem, for which we refer the reader to can be 
done by choosing a proper disintegration of any optimal measure in such a way 
that the elements of this disintegration are the solutions of finite dimensional 
Monge-Kantorovitch problems. The latter is proven with the help of the section- 
selection theorem 0, Q ■ 

The notion of spread measure is key to all the results about the measure 
transportation: 

Definition 6.2. A probability measure m on (W, B(W)) is called a spread mea- 
sure if there exists a sequence of finite dimensional regular projections (7r n ,n > 
1) converging to Ih such that the regular conditional probabilities m{ ■ \ir^ = 
cc„ ), (where ir^ = I\y — it n ) which are concentrated in the n-dimensional spaces 
TTn{W) + vanish on the sets of Hausdorff dimension n — 1 for 7r^(m)- almost 
all x„ and for any n > 1 . 

Remark 6.2. Clearly any measure absolutely continuous with respect to jj, is 
spread. 



4 In fact the results of this section are true for the positive measures of equal mass. 
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Theorem 6.4 (General Monge-Kantorovitch transportation). Suppose 
that p and v are two probability measures on W such that 

d H {p,v) < oo 

and that p is spread. Then there exists a unique solution of the Monge-Kantorovitch 
problem, denoted by 7 £ v) and 7 is supported by the graph of a Borel map 
T which is the solution of the Monge problem. T : W — > W is of the form 
T = I\y + £, where £ £ H almost surely. Besides we have 

d \{p, v ) = / \T(x) -x\%dj(x,y) 

JWxW 
= / \ T ( X ) ~ x \ 2 H d p( x ) i 

cmd, wzt/i i/ie notations o f Dcfinition \6. M for tt^; p- almost almost all x^ , the map 
u — > £(u + 2;^;) is cyclically monotone on {n^) -1 {%n} , w i/ie sense that 

N 

^2 (€( X n + U *)> Ul+1 ~ U% )h - 
i=l 

-K^p-almost surely, for any cyclic sequence {ui, ■ ■ ■ , ujy, ujv+i — ui} from n n (W). 
Finally, if v is also a spread measure then T is invertible, i.e, there exists 
S : W — ► W of the form S — Iw + V such that r\ £ H satisfies a similar 
cyclic monotononicity property as £ and that 

1 = l{(x,y)G Wx W:ToS(y) = y} 
= l{{x,y) 6 W x W : SoT(x) = x} . 

In particular we have 

d 2 H (p,v) = / \S(y)-y\ 2 H d'y(x,y) 

JWxW 

= / \S{y) - y\ 2 H dv{y) . 
Jw 

The case where one of the measures is the Wiener measure and the other is 
absolutely continuous with respect to p is the most important one for the ap- 
plications. Consequently we give the related results separately in the following 
theorem where the tools of the Malliavin calculus give more information about 
the maps £ and r\ of Theorem 16. 41 

Theorem 6.5 (Gaussian case). Let v be the measure dv — Ldp, where L 
is a positive random variable, with E[L] = 1. Assume that dnipjv) < 00 (for 
instance L £ ILloglLj. Then there exists a 1- convex function (j> £ ©2,1 > unique 
up to a constant, such that the map T = I\y + V</> is the unique solution of the 
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original problem of Monge. Moreover, its graph supports the unique solution of 
the Monge-Kantorovitch problem 7. Consequently 

(I w x T)n = 7 

In particular T maps /i to v and T is almost surely invertible, i.e., there exists 
some T~ l such that T v = /i and that 

1 = [i \x : T -1 o T(x) = x} 

= v{yeW: ToT-\y)=y} . 

Remark 6.3. Assume that the operator V is closable with respect to v , then 
we have r] = SJip. In particular, if v and [i are equivalent, then we have 



iw 



where is ip is a I -convex function. 

Let us give some applications of the above theorem to the factorization of 
the absolutely continuous transformations of the Wiener measure. 

Assume that V = Iw + v '■ W — > W be an absolutely continuous transforma- 
tion and let L 6 1L^(/j,) be the Radon-Nikodym derivative of V fi with respect 
to ji. Let T = Iw + V</> be the transport map such that T/i = L.^l. Then it is 
easy to see that the map s = T _1 o V is a rotation, i.e., s/i = fx (cf. j2^|) and it 
can be represented as s = Iw + ex. In particular we have 



V0 o S = V . 



(6.3) 



Since <fi is a 1-convex map, we have h — > + 4>(x + h) is almost surely 

convex (cf.||). Let s' — Iw + be another rotation with a' : W — > H. By the 
1-convexity of <f>, we have 



■OS > — a 



H 



(a + V</> o s, a' 



H 



/z-almost surely. Taking the expectation of both sides, using the fact that s and 
s' preserve the Wiener measure [i and the identity (|6.3|l . we obtain 



E 



(v,a) H 



< E 



(v,a')u 



Hence we have proven the existence part of the following 

Proposition 6.6. Let H2 denote the subset 0/ L 2 (/x, H) whose elements are 
defined by the property that x — > x + rj{x) is a rotation, i.e., it preserves the 
Wiener measure. Then a is the unique element of IZ2 which minimizes the 
functional 

"1, 



T) -> M v {rj) = E 



- 2 n\ 
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Proof. The only claim to prove is the uniqueness and it follows easily from 
Theorem 1531 □ 
The following theorem, whose proof is rather easy, gives a better understand- 
ing of the structure of absolutely continuous transformations of the Wiener 
measure: 

Theorem 6.7. Assume that U : W — > W be a measurable map and L £ 
ILloglL(/i) a positive random variable with E[L] — 1. Assume that the measure 
v = L ■ p is a Girsanov measure for U , i.e., that one has 

E[foUL] = E[f], 

for any f £ Cb(W). Then there exists a unique map T — Iw + with <f> £ ©2,1 
is l-convex, and a measure preserving transformation R : W —> W such that 
U o T — R p-almost surely and U = R a T~ l v -almost surely. 

Another version of Theorem 16 . 71 can be announced as follows: 

Theorem 6.8. Assume that Z : W — > W is a measurable map such that Z\x <C 
[l, with dft{Z [a, fi) < 00. Then Z can be decomposed as 

Z = T o s , 

where T is the unique transport map of the Monge-Kantorovitch problem for 
E(/it, Z\i) and s is a rotation. 

Here is a more general version of the polar factorization: 

Theorem 6.9. Assume that p and v are spread measures with dH{p,v) < 00 
and that Up — v, for some measurable map U : W — > W. Let T be the optimal 
transport map sending p to v, whose existence and uniqueness is proven in 
Theorem \6.J\ Then R = T _1 o U is a p-rotation (i.e., Rp = p) and U — T o R, 
moreover, if U is a perturbation of identity, then R is also a perturbation of 
identity. In both cases, R is the p-almost everywhere unique minimal p-rotation 
in the sense that 

\U(x) - R{x)\ 2 H dp{x) = inf / \U(x) - R\x)\ 2 H dp{x) , (6.4) 
where 1Z denotes the set of p-rotations. 

Proof. Let T be the optimal transportation of p to v whose existence and unique- 
ness follows from Theorem l6.4l The unique solution 7 of the Monge-Kantorovitch 
problem for E(p, v) can be written as 7 = (I x T)p. Since v is spread, T is in- 
vertible on the support of v and we have also 7 = (T _1 x T)v. In particular 
Rp = T^ 1 o Up = T~ l v = p, hence R is a rotation. Let R' be another rotation 



it 
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in 1Z, define 7' = (R' x U)p, then 7' s £(p, f ) and the optimality of 7 implies 
that 7(7) < J (7'), besides we have 

|Z7(a:) -E(aO|ijdp(aO = / |C/(x) - T -1 o U(x)\ 2 H dp(x) 
11 iff 

|x-T -1 (x)|^di/(a:) 
|T(z) - z|^dp(:r) 

= At). 

On the other hand 

J( 7 ')= / \U(x)-R'(x)\ 2 H dp(x), 
Jw 

hence the relation (|6.4(l follows. Assume now that for the second rotation R' S TZ 
we have the equality 

\U(x) - R(x)\ 2 H dp(x) = I \U{x) - R'{x)\ 2 H dp{x) . 
w Jw 

Then we have J (7) = J{"f'), where 7' is defined above. By the uniqueness of the 
solution of Monge-Kantorovitch problem due to Theorem 16.41 we should have 
7 = 7'. Hence (R x U)p = (R' x U)p = 7, consequently, we have 

f(R(x),U(x))dp(x) = / f(R'(x),U(x))dp(x), 
w Jw 

for any bounded, measurable map /oaWx IV. This implies in particular 

(a o T o R) (b o f7)dp = /" (aoToR 1 ) (bo U)dp 
w Jw 

for any bounded measurable functions a and &. Let V = To R' , then the above 
expression reads as 

/ a o U b o V dp = / a o U' b o Udp . 
Jw Jw 

Taking a — &, we obtain 

(a oU)(ao U') dp = \\a o U\\ L , [p) \\a o U'\\ L 2 {p) , 

w 

for any bounded, measurable a. This implies that a o U = a o U' p-almost 
surely for any a, hence U = U' i.e., T o R = T o R' p- almost surely. Let us 
denote by S the left inverse of T whose existence follows from Theorem lfi.4l and 
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let D = {x G W : S o T(x) — x}. Since p(£>) = 1 and since R and R' are 
p-rotations, we have also 

p(Dn R~ 1 (D) n R'-\d)) = 1 . 

Let x G W be any element of D n R~ l (D) D R'^ 1 {D), then 

J?(x) = SoToR(x) 
= SoToR'(x) 

= R'(x), 

consequently R = R' on a set of full p- measure. □ 
Although the following result is a translation of the results of this section, it 
is interesting from the point of view of stochastic differential equations: 

Theorem 6.10. Let (W,p,H) be the standard Wiener space on JR d , i.e., W = 
C(1R+, TR d ). Assume that there exists a probability P -C [i which is the weak 
solution of the stochastic differential equation 

dy t = dW t + b(t, y)dt , 

such that dniPit 1 ) < oo. Then there exists a process (T t ,t G H+) which is a 
pathwise solution of some (anticipative) stochastic differential equation whose 
law is equal to P. 

Proof. Let T be the transport map constructed in Theorem 16 . 51 corresponding 
to dP/dfi. Then it has an inverse T" 1 such that /ijT^ 1 o T(x) — x} — 1. Let cf> 
be the 1-convex function such that T = Iyy + and denote by {D s 4>, s G M+) 
the representation of V0 in L 2 (M+,ds). Define T t (x) as the trajectory T(x) 
evaluated at t G 1R+. Then it is easy to see that (T t ,t G H+) satisfies the 
stochastic differential equation 

T t {x) = W t (x) + [ l(s, T(x))ds , t G IR+ , 
Jo 

where Wt(x) = x(t) and l(s,x) = D s (f>oT^ 1 (x) if x G T(W) and zero otherwise. 

□ 

Definition 6.3. Assume that p and v are two probability measures on {W, B{W)). 
We say that they are at locally finite Wasserstein distance from each other if 
there exists a measurable partition (Z ni n > 1) of W such that 

dnip-, v n ) < 00 ■ 

for any n > 1, where v n is the probability defined by v n (A) = i>{A n Z n )/v(Z n ). 

Remark 6.4. Any measure v which is absolutely continuous with respect to p 
is at locally finite distance from p. 
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Theorem 6.11. Assume that p and v are at locally finite distance from each 
other and that p is a spread measure. Then there exists a map T : W — > W, 
of the form T = Iw + £ (i.e., perturbation of identity) such that T maps the 
measure p to the measure v. 

Proof. Let (Z n ,n > 1) be a partition of W such that du(p,fn) < oo for each 
n € IN. Let (M n , n > 1) be a partition of W such that p(M n ) — v(Z n ) for each 
n £ IN. Define t n as the map which maps the measure 1 m„ P to the measure 
v(Z n ) p. Let also U n be the ^/-cyclically monotone map whose existence has 
been proven in [T^j, which maps v(Z n )p to the measure lz n v. Define T on M n 
as the map T n = U n o t n . Then it is easy to see that Tp = v. 



Question: We do not know in general that given two measures one of which 
being spread, if there is just one perturbation of identity which is cyclically 
monotone which maps the spread one to the second measure. Even in the case 
where the first measure is p and the second one is dv = Ldp where L is positive 
and E[L] — 1 this question is unanswered. In finite dimensional case the answer 
to this question is positive; in the infinite dimensional case the difficulty is due 
to the fact that the Cameron-Martin norm on W is only lower semi-continuous, 
hence, the weak limit of measures of cyclically monotone support on W X W do 
not have necessarily cyclically monotone support. 

7. The realization by a "signal and independent noise" 
7. 1 . Introduction 

Let (W, H, p) be a standard one-dimensional Wiener space and let (0,.F, p) 
be a probability space unrelated to (W,H,p). Let (W x Q,B(W) V T,p x p) 
be the corresponding product space. Let (u(9),6 G 8) be a random variable 
on taking values in the Cameron-Martin space H, i.e. u = J Q u s (0) ds. Let 
y = u(9) + w and as specified above w and u are independent. We will use 
Py\u to denote the conditional measure of y conditioned on the sub sigma field 
generated by u; also, pu\Y will denote the conditional measure of u conditioned 
on the subsigma field generated by y. Then 





n 





□ 
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and 

The problem considered in this section is the characterization of those non- 
negative random variables on (W,H,n), say L(w), where E[L] = 1 which are 
realizable by the model of equation (|7.1(l . We will assume throughout this section 
that E[L 2 } < oo. 



7. 2. Preliminaries 

Since E[L 2 } < oo, we have by the Wiener-Ito representation 

oo 

LM = l + £>(/ n ) 
n=l 

where (/ n (ii, • • • ,t n ),n > 1), are real valued and symmetric functions on [0, l] r 
^™(/ n ) is the multiple Wiener-Ito integral 

f(tx, ■■■ ,t„) dw(tx) . . . dw(t n ) 

[0,1]" 



and 

oo 

'lL2([o,i]») 



£[L 2 ] = l+5>!||/ n | |2 



A random variable F(w) will be said to belong to H\ for some real A > 1 if 
E[F 2 ] < oo and E[F%) < oo where 

oo 

1 

For < A < 1, H\ is defined as the dual to Hi/\ (relative to L 2 (fi)) and in this 
case the elements of H\ are generalized Wiener functionals. Set 



Hq = [J H\ ; Hoc = P| ■ 

A>0 A>1 

Then, for A > 1 

HcoCH A cHi = L 2 (W) c Hi /a C H . 

An element F € Ho will be said to be positive and denoted F > if (-F, G) > 
for every element G of 7ioo and satisfying G > 0. For F G 7ii this means 
that F(w) is /i-a.s. non negative. Moreover, let P a , a > 0, denote the Ornstein- 
Uhlenbeck semigroup 



P a F{w) — E F (e~ a w + VI - e- 2a {w)) \B(W) 
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where w is an independent copy of the Brownian motion w living on (W, B(W)). 
Therefore if F = jr 5 n (f n ), then 

P a F = J2e- na 5 n (f n ) 

and since P a preserves positivity, if F G Ho and F\ > then Fp > for all (3 
satisfying < (3 < A. In fact P a is even positivity improving in the sense that 
if F G L 1 ^) is a.s. non-negative then P a F > almost surely. 



7. 3. The positivity of random variables 

Let F be in Ho, define A pos (F), the positivity index of F, as the supremum of 

all A > such that Fa > and \ pos {F) = if this set is empty. Hence F\ > 

for all A G (0, A pos (F)). 

In order to continue we recall the following two definitions: 

(a) Let 4>{y) be a complex- valued on y G H . <fi(y) is said to be positive 

definite if for all n, all yi G H , d G C, i = 1, 2, . . . n 

n 

(b) Let F E Ho and for any h E H set = cxp(i5h + ^\h\%). The modified 
r transform of F, tf, is defined by 

oo 

f F (ft) - (F , e(/i)) = ]T*" </n.^ n ) ff8B - (7-2) 

71 = 

It follows directly from the last equation that 

f Fx (h) = f F (Xh) . 

The following is a key result for the characterization of the realization problem 
in the next subsection. 

Theorem 7.1. Let F G Ho, then F > if and only if h — > tf(/i) exp — \\h\ 2 H 
is positive definite on H. 

Proof. The necessity is straightforward: Suppose that F > 0. Then for any 
{hi, ■ ■ ■ , h n } G H , {ci, • • ■ , c„} C C and n > 1, we have 







n 


2" 



















(7.3) 
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where G 



En . 
.7 = 1 e " 



-XShj 



> and G G Hoo = n x Hx; therefore (F, G) > 
is positive definite. For the converse direction 



and hence h — > tf{K) exp — ^|, i 

cf. or chapter 5 of [3| and the references therein. □ 
Remark: It is easy to verify from the last theorem that F G Hq is strongly 
positive if and only if TpQi) is positive definite. 



7-4- IVie realization by a "signal and independent noise" model 

Let (0, J-, p) be as defined earlier, let u be an H- valued random variable on this 
space. Further assume that for all n > 1 ii, . . . ,t n C [0, 1], E[u(ti)u(t2) • ■ • 
exists as a symmetric square integrable symmetric function f n {t\, ■ ■ • , t n ) G 
L 2 ([0, 1]™). The collection of all such u defined on some probability spaces 
{®,Fe,p) will be denoted by Uh- 

Definition 7.1. A generalized junctional F £ H.q will be said to be induced by 
u, if there exists a probability space (O,^ 7 , p) and an H valued r.v. u defined on 
it such that 

fn(h, ■ ■■ ,t„) = —:E[u(ti)u(t2) • ■ ' u(t n )} 
n .1 



other words 



1 



f n =-E[u® n ] 



for all n > 1 . Now, if F G L 2 (p) is induced by some u G Uh and 



then, setting 
yields 



Y(w,6) = exp ( Su- -\u\ 2 H 



F{w) = E 



exp Su 



Fw 



= ^E[S n u® n \F w ] 

oo 

J2d n (E P [u® n ]) 



71=0 



We can now state the following characterization of the realizability of Radon- 
Nikodym derivatives by a "signal plus independent noise" model. 
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Theorem 7.2. Let F G H with F = ££L and ( F > X ) = 1 then there 

exists a cylindrical probability measure p on H such that for all hi, ■ ■ ■ , h n G H, 
n>l 

(f n ,hi <g> ■ • ■ ® /i n ) = -r / {h,hi) H . . .(h,h n ) H p(dh) 



and f2(ti,t2) in of trace class on [0, l] 2 if and onfo/ i/-F zs realizable. 

Proof. If a probability measure exists, then cf. equation l|7.2|) and theorem 17. II 

~ oo 

/ exp i(ho, h) HP (dh) = 1 + J2 i n (f n , h® n ) = f(h) . 
Jh i 

Therefore fp is positive definite hence F is strongly positive. The converse 
follows by Theorem 7.1 and Theorem 2.1 of [12j since the trace-class property 
of fi implies the fact that the measure p is in fact sigma additive measure 
on H. This means the existence of an H- valued random variable u whose law 
is p. In fact, let (Q,F,p) be any probability space rich enough to induce a 
random variable u : <d — > H whose law is p. Define on the product space 
(W x 8, B(W) ® J 7 , fi x p) the map Y = u{6) + w, then clearly 

— — (w) = F(w) . 
dp 

□ 
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